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I,   TiTTRODUCTION 

Rooted  binary  trees  with  weighted  nodes  are  structures  encountered  in  many 
areas ,  such  as  coding  theory,  searching  and  sorting,  information  storage  and 
retrieval. 

A  common  quantity  of  great  importance  is  the  weighted  path  length.   An 
estimate  of  it  gives  indications  about  the  expected  time  of  a  search  or  the  length 
of  a  code,  for  example.   The  knowledge  of  lower  and  upper  bounds  would  permit  such 
estimates. 

The  noiseless  coding  theorem  in  Information  Theory  provides  a  lower 
bound  for  the  weighted  root-leaf  path  length.   But/  until  recently,  upper  bounds 
which  are  the  more  meaninfgul  for  many  applications,  were  still  lacking.   They 
were  first  obtained  for  unweighted  trees,  introducing  the  concept  of  structural 
balance  of  a  tree  [1].   Then  upper  bounds  for  various  weighted  trees  have  been 
derived  \ising  structural  balance  and  the  new  concept  of  weight  balance  of  a  tree  [2]. 

In  this  paper,  using  a  different  definition  of  the  weight  balance  of  a 
tree,  we  derive  two  upper  bounds  for  the  total  path  length  of  general  weighted 
node  trees. 

The  first  one  introduces  two  parameters  y   and  p  which  sharpen  the  bound 
but  complicate  its  expression.   The  parameters  y   will  be  useful  every  time  we  have 
a  certain  knowledge  of  the  weight  distribution.   However,  if  the  estimation  of  7 
or  (3  requires  too  much  computation,  we  can  take  them  equal  to  zero  and  derive 
from  this  upper  bound  a  simpler  one. 

The  second  upper  bound,  using  a  different  normalization,  will  be  useful 
when  the  entropy  of  the  weights  of  the  nodes  is  not  known. 


II.   DEFINITIONS  AKD  NOTATION 

A  binary  tree  T  is  a  finite  set  of  n  nodes  which  is  either  empty 
(if  n  =  0)  or  else  is  partitioned  into  the  following  three  classes:  A'  single  node 
r  called  the  root  of  T  ,  a  binary  tree  T  on  the  g  nodes  1,    .  .  ,   r-1  called  the  left 
subtree  of  the  root  and  a  binary  tree  T,  on  the  d  nodes  r+1,  .  .  ,   n   called  the  right 
subtree  of  the  root  (r  -  g+1,  g  +  d  +  1  =  n). 

The  subscripts  g  and  d-  will  always  refer  respectively  to  the  left  and 
right  subtrees  of  T  .   In  the  above  definition  they  have  two  meanings.   They 
indicate  both  subtrees  and  number  of  nodes. 

A  weighted  binary  tree  is  a  binary  tree  T  such  that  a  non-negative  real 

number  w ,    called  a  weight,  is  assigned  to  each  node  k. of  T  .   We  denote  a  weighted 
k.  n 

binary  tree  of  n  nodes  by  the  (n+l)-tuple  (T  ;  w .,    ..,   w  ). 

W,  W  ,  W,  will  denote  respectively  the  sum  of  the  weights  of  T  ,  T  , 
'   g   d  *  D       n'   g' 

and  T,. 

d 

Weight  distribution.   We  will  restrict  the  weight  distribution  to  the 
following  case:  At  least  one  of  the  two  sons  of  each  non- terminal  node  must  have 
a  strictly  positive  weight. 

The  weighted  path  length  |t  j_  of  a  tree  Tn  is  defined  as  the  sum  over  all 
the  nodes  of  the  product  of  the  weight  of  the  node  and  the  level  of  the  node.   The 
weighted  path  length  satisfies  the  following  equalities : 

bil  =o 

It  I  =  It  I  +  It  J  +  w  +  w.   (n  >  1) 

1  n1    '  g1    '  d1    g    d 

Weighted  root  balance  p(T  ).   The  two  upper  bounds  depend  on  a  parameter 
which  measures  the  "balance"  of  a  tree,  in  the  sense  of  how  close  the  total  weights 
of  the  left  and  right  subtrees  of  the  root  are  to  each  other.   The  following 
quantity: 


1  Wrr        Wd 

P(T  )  =  —  if  n  -  1;  otherwise,  p  (T  )  =  min(  r-r —  ,  77 )  serves  partially 

v  11    2  '    n        W-w  '  W-w  '  *  J 

r  1   r 
this  purpose.   This  definition  implies  0  <  p  (T  )  <  t.   According  to  the  restriction 

made  previously  about  the  weight  distribution,  p(T  )  =0  whenever  T  has  a  right 

or  left  subtree  of  one  node,  the  weight  of  which  is  equal  to  zero.   (W  •  W  =  0). 

We  will  see  later  that  such  a  value  must  be  avoided  because  it  would  give  a  bad 

estimate  of  Ihe  weighted  balance  of  T  .   We  notice  that  the  weighted  path  length 

It  I  remains  the  same  if  the  weights  of  the  two  sons  of  T  are  interchanged.   Let 
1  n1  n  ° 

w  denote  the  positive  weight  of  the  two  sons  of  the  root  of  T  when  the  condition 

W  •  W ,  ~  0   arises,  then  the  following  definition: 
g    d  ' 

p(T  )  =  yz     if  n  =  1,  otherwise 
n    2 

W     W^  " 
if  W  ■  W A  f   0,  then  p(T  )  =  min(-J 


"d  r  ~f    —      Mv-n/  -  «-« vw_w  >   w_w   / 

r      r 

Wo-  j.  W*  -  w    ¥ 


Wrr  +  «fl 

if  W  •  W,  -  0,  then  p(T  )  =  minC-^—r — = 


,    „d  -  «,  w„™  M\*n/  -  —v   w_w      ;  w_w 

r  1 


makes  the  weighted  root  balance  strictly  positive,  except  for  the  weighted  binary 
trees  of  3  nodes  when  one  son  of  the  root  has  a  weight  equal  to  zero. 

The  weighted  balance  (3(T  _)_  is  equal  to  —  if  n  =  1  or  n  =  3,  otherwise 
p(T  )  =min[P(T  ),  p(Tr),  p(T ,)].   Although  the  weighted  root  balance  can  be 
equal  to  zero  for  trees  of  three  nodes,  we  notice  that  the  weighted  balance 
(3(T  )  for  n  >  1  is  always  strictly  positive.   We  deduce  from  this  definition 
the  following  inequality. 

for  n  =  1  or  n  -  3:  0  <  p(T  )  <  p(T  )■  <  i  . 

•      '  v  n  —    n  —  2 

Terminal  weighted  balance   (3rn(T   )  : 

[3m(T   )    =  p(T   )   if  n   =  1  or  3,    otherwise: 
T     n  ii  y* 

PT(Tn)   -mnLpT(T  ),  PT(Td)] 


this  definition  implies  that  0  <  p  (T  )  <  -  .   This  parameter  appears  in  the 
first  upper  bound.   Its  evaluation  is  easy  and  it  sharpens  the  bound  significantly. 
In  particular,  it  makes  the  bound  equal  to  the  weighted  path  length  for  trees  of 
one  or  three  nodes.   However,  in  any  case,  if  we  don't  want  to  estimate  it  we  can 
take  it  equal  to  zero. 

be I initio a  of  7 :   This  parameter  appears  also  in  the  first  upper  bound. 
It  can  be  verified  easily  that  its  value  does  not  matter  for  trees  of  one  or 
three  nodes.   Moreover,  the  bound  is  equal  to  the  path  length  for  every  value  of 
7.   Therefore,  the  value  of  7  needs  only  to  be  estimated  when  n  >  3. 


if  n  =  3    7(T5)  =  5(Tj 


otherwise  y(l   )  =  min(5(T  ),  j  (l   ),  /(T,))- 

with      5(t  )  =  (a  +  1)  log (a  +  l)  -  a  log  a 

W  -  wr 

and  where  a   =  

w 
r 

The  assumption  made  previously  about  the  weight  distribtution  implies  that  for 

n  -*  3 j   7(T  )  has  always  a  finite  value,  even  if  w  =  0. 

The  expression  of  the  first  upper-bound  shows  that  the  parameter  7  will 

sharpen  strongly  the  bound  if  its  value  is  high.   However,  this  parameter  will  be 

useful  only  whenever  we  have  a  sufficient  knowledge  of  the  weight  distribution  of 

T  ,  because  its  estimation  implies  some  computation.   Nevertheless,  7  as  well  as 

(3 „,  can  in  any  case  be  taken  equal  to  zero.   This  corresponds  to  the  fact  that  if 

n  >  1,  then  W  -  w  >  0. 
—  r  — 

Entropy  H(x).   We  will  introduce  the  following  quantity  in  the  two 
bounds : 

H(x)  =  -[x  log  x  +  (1-x)  log(l-x)]  for  0  <  x  <  1. 


L  or  L(T  )  will  denote  the  sum  of  the  weights  of  the  terminal  nodes  of 
^— n- 


All  the  logarithms  in  this  paper  are  taken  to  the  "base  two. 


III.       TWO  UPPFR   BO  l,;rs   FOR  T;HT:  t-O'I  TGTITI'ID  1'ATH   LENGT:T   OF   BINARY  TRI'IES 

1.  Inequality  Uscd 

Let  {x,  .  x_,  . . ,  x  }  be  a  set  of  n  non-negative  real  numbers  such 
1'   2'   '     n 
n 

that  S  =  Z  x.  >  1.   Let  x,  denote  any  one  of  these  n  numbers  and  a.     a  real 

1=1 

positive  number  such  that  S  >  (l  +  OL   )x,  .   Then,  we  have  the  following  inequality: 

(l)   (S-x.  )  log  (S-X.  )  <  S  log  S  -  x^  log  x,  -  8X   where 

5 =  (o^  +  1)  log  (c^  +  1)  -  ak  log  a-. 

Proof :   Let  f  (x)  =  x  log  x  and  a  and  b'  be  two  real  numbers'  such  that  1  <  a, 

0  <  b.   Then  we  have: 

f(l  +  b)  -  f(l)  <  f(a  +  b)  -  f(a). 

S  -  xv        "^k 
Applying  this  relation  with  a  =  — " — -  ,   b  = ,   we  obtain  the  previous 

^^         w 

inequality  (l). 

2.  First  Upper  Bound 

Let    (T   j  W,  f    • . ,  w   )   be   a  weighted  binary  tree,    then  the  weighted 
path  length    |t    |    satisfies  the  following  inequality: 


TJ    <    IM      [(W-Wr}   los(W-\)   +(     £     V°«^     -     V°S  h      ~7    (¥"Wr)] 


•i-    (/   +  1  -  H(pT))L 


where  p,  p  ,  7,   L  have  the  definitions  given  in  (il) 


Proof:   Case  (i).   The  weight  distribution  verifies  the  restriction  introduced 
previously  and  wc  assume  that  if  v  /  0  then  w,  >   1  for  all  k. 

)   For  n  =1,  W  =w  and  L(T,  )  =  0,  the  assertion  is  true.   We  can 


a 
also  write : 


(2)   |Tn  |  <  ~~  [W  log  W  +  w  log  i-  -  7  w .  ]  +  (7  +  1  -  H(i))  W, 
1  -  H(i}  1^1  2    1 

rfhis  inequality  which  holds  for  every  value  of  7  will  he  useful  in  the  following 
part  of  the  proof. 

b)  Assume  that  the  assertion  is  true  for  all  i  <  n  and  let 

(T  ;  w'   ...  w'),  (T-,;  w"   ...  w")  be  the  left  and  right  subtrees  of  T  .   Hence 

G       J>  (J  d       I  d  n 

w'    =  w,    for  1  <  k  <  r-1  and  w."   =  wn         for  1  <  k  <  n-r        (g   =  r-1  and  d  =  n-r). 
k         k  —       —  k    .     k+r  —       — 


Let  [irr,    Pm   ;   7 m\    $A,   Prp   ,  7 A  be  respectively  the  parameters  of  T„  and  T^   defined 
g 


>,)      Prp      ;      /       •       p^,      Prp     ,      f   ,      W     IC^^Ui»Ci.jf       W«     pal  amc  uci  fi     yjx      X         CMAU     X^ 


in  (il).   Then,  according  to  the  relation  T   =  T   +  T,  +  W  +  W,  we  write 
/  '  '  n1    '  g1    '  d1    g    d 

1  g         l  1 

|TJ  ^  htp-j  [we  -  wP  io«  (ws  -  w;> +  4  k  ioe  ^-  -  w;loe  sj  -  Wvr> 

+  HIT^T     [Wd  -  WP  l0E(Wd  -  wr>  +    *    wk  l0«  V  "Wr  los  5"  -  ^Vr)] 


+  (7   +  1  -  H(p,_  ))  L  +  {7  ,  +  1  -  H(pm  ))  L,  +  W  +  W . 
g 


Using  the  equality  L  =  L  +  L ,  and  the  three  relations 


p(Tn)  =min[p(Tn),  pg,  pd],  PT(Tn)  =  min[pT  ,  PT  ],  7^)  »nln[6(Tn),  7g,  7(i] 

defined  in  (il),  we  write: 

.      ,          1  g                      1 

T       <  ttt^n    [W     -  w')   log(W     -  w')  +    (W.   -  w")  log(W,   -  w")   +     Z     w/    log  -4 

1    n1   -  H(p)        g         r'              Z         *"'        x    d         ry  eN    d         r'                  k       &  w ' 

d  1 

+  £     w£  log  ^r  -  7(Wg   -  WjJ  +  Wd  -  wp]  +    (7   +   1  -  H(PT))L  +  Wg  +  W^ 


If  we  assume  that  T     and  T     have  both  more  than  one  node   (g  >  1  and  d  >  l)   then 

S  d 

W     -  w'  >  1  and  W     -  w"  >  1.      We  can  apply  the  inequality  (l)   defined  at  the 
£>         j  u.         r  — 

beginning  of  (ill).      Moreover,  using  the  relation  7    -  min[s(T   ),  7(T   ),  7 (T   )  ], 

II  £2  U. 

we  obtain  the  pair  of  inequalities: 


(3)      0-r     -  V)   log    (V.T     -  w')     <  W     log  W     +  w'    log  — 7w' 

g         r  g         r       -    g  g         r       b  w'         7    r 


(W,   -  w")   log    (W_    -  w")  <  W.  log  W.  +  v"  log  -4-  -  yw" 
d         r  d         r/  —     d  d         r       b  w  '    r 

r 

We  noiA  obtain  the   following   inequality: 

,       ,           1                                                                  n                  l  1 

(1+)        T       <  —^    \M     log  W     +  W,  log  W     +     Z     v,  log  —  -  v  log  — 

1    n1   -  H(p)    Lg       tog          d       °d       ,•■         k  ve>  w,  r     &  w 

k=l                  k  r 


-  7(W-Wr)l  +    (7   v-  1   -  H(pT))L  +  Wg  +  Wd 


If,   however,    T     and  T     are   such  that  g   ~  1  or  d  =  1,  we  can't  apply  inequality 

g  M- 

(l)  because  W  -  w'  or  W,  -  w"  is  equal  to  zero.   We  will  use  instead  the 
g    r     d    r 

expression  (2)  derived  at  the  "beginning  of  the  proof  when  n  =  1.   Assume  that 

T  and  T,  are  such  that  g  >  1  and  d  =  1,  then 
g      d 

lTdl  <  STO  [wa  loe  wd  +  vr  l04- "  V^  +  (7d  +  1  -  H<PT JK 

VKd  r  d 

where  B,    =  Bm     =    w     ,7-,  having  any  value,   therefore,   after  similar  steps  as 
a        T ,        2     '     a  »     'v 

d 

before,  we   obtain: 

|T    !      <     tt^tx    [W      -  w')   log(W     -  w')   +  W^   log  W ,  +       Z  w.1    log  -7-  -  w'    log  -i 
1    n1     -    H[p)       g         ?'       ^v   g         ry  d       &     d         k=l  k         r  wr 

+  w"  log     irr     -  y(W     -  w«    -  w")]  +    (7  +  1  -  H(bJ)L  +  W     +  W , 
r  w  g         r         r  \Krp/;  g         ^ 

If  now  we  apply  the  first  of  the  pair  of  inequalities  (2)  we  obtain  the  inequality 
(if)  already  derived. 

Hence  for  every  left  and  right  subtree,  we  have: 

|TrJ  £  hW  fWC  1CS  Wg  +  Kd  l0G  Wd  +  *\   loe  r  "  "r  l0S  5"  "  r(W"Wr)] 


k-1        k  r 


+  (7  +  1  -  H(PT))L  +  W  +  Wd. 


The  quantity  W     log  W     +  W     log  W     can  "be  expressed  in  terms  of  the  weighted  root 


"balance  p  (T   ) 

W     log  W     +  W.,   log  W, 

g  C  d  ci 

b  p  (W     +  Wd)    [log(W     +  Wd)  +  log  p  ]  +    (1  -  p  )  (W     +  Wd)  [log(l-  p)   +  log(Wg  +  Wd)  ] 

=  (W-wJ   log    (W-w.)   +    (W-w.  )[  plog  p    +    (1  -  p  )   log    (1  -  p  )] 

p  log(  — )  +  (l  -  p  )  log  U — )  is  the  entropy  of  the  weighted  root  balance  of  T  . 
Then  if  we  recall  that  0  <  p  <  P  <  -,  we  have  0  <  H(p)  <  H(  p)  <  1. 

1                     n  1  1 

|T  I  <  TT/^\  [W-w  )  log(W-"W  )  +   2  w.  log w  log y(W-w  )] 

1  n'  -  11(g)     ry   oV   r'          ,  .  Is.  B  w1  r  &  w        ry 

VK/                   k=l  k  r 

+  (7  +  1  -  H(PT))L+  (1  -§[$  )  (W-wr) 

Hence  the  induction  hypothesis  is  verified.   The  weighted  path  length  |T  | 

satisfies  the  following  inequality: 

.   .    1  n  1  1 

T   <  rrr^T  [(W-w  )  log  (W-w  )  +  Z  w,  log  —  -  w  log  —  -  7  (W-w  )] 

1  n  —  H  (p  J      r         r    .  .   k  w,     r  w        r ' 

K— J-         K  r 

+  (y  +  1  -  H(pT))L 

Case  (ii):   In  the  general  case,  let  w,  =  —   where  w  .   is  the 

k    w  .         min 
min 

minimum  of  all  the  positive  weights.  .  Then  w  >  1  for  all  k  such  that  w  j=   0. 
The  result  obtained  in  case  (i),  applied  to  this  new  weight  distribution,  gives 
immediately  the  desired  result. 

Remark :  Except  for  particular  distributions  of  the  weights  like  a  descending  one, 
for  example,  it  is  difficult  to  obtain  an  upper  bound  which  is  both  sharp  and 
simple.   Jf  we  don't  want  to  .introduce  the  parameters  y   and  (3  ,  we  can  set  both 


of  them  equal  to  zero.,  This  leads  us  to  the  simpler  bound: 

n 

-y  [W-W  )  log  (W-W  )  f   Z       W   log  - 

k  r 


1-1  n  1  1 

V   --  HlTiJ   D'MV}   ;i°o(W-Wr)    f     2     wk  log  —     -  wp  log  ~-    ]  +  L 

k=i  k  r 


10 


3-   Second  Upper  round. 

"n 
Although  the  entropy   Z  w,  log  —  is  a  natural  quantity,  it  may  not 

k=l        k 
always  be  known.   For  such  cases,  the  following  bound  would  be  useful. 

i    I      T_  W-Wv> 

|Tnl  <  H(pT  [(W'V  1CS  (^  "  2  (W"wr)]  +  2L 

mm 

Wmin  iG  the  I,ilnil;;a;i  of  s11   thcpositive  weights  of  T  .   The  proof  is  quite  similar. 
We  would  have  the  following  substitutions  in  part  (ill). 

Inequality  Us e d : 
(1)   (S  -  x^)   log  (S  -  xk)  <  S  log  S  -  2  x^ 


s>V\=0or\^ 


Proof:  case  (i) 
(2)   IT-J  <  i^i *  [W  log  ¥  -  2  WXJ  +  2  W 


(3)   (Wg  -  wp  log  (Wg  -  Wp  <  Wg  log  Wg  -  2  w; 


(Wd  -  wp  log  (Wd  -  vp   <  Wd  log  Wd  -  2  w^ 


M  K\    <     |jgj  [\  log  Wg  +  Wd  log  wd  .  2  (Wg  +  Wd)]  +  2  Lg  +  2  Ld  +  Wg  +  Wd 
case  (ii)  remains  the  same. 
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